Predicting the Triple Beta-Spiral Fold from Primary Sequence Data
نویسندگان
چکیده
The Triple β-Spiral is a novel protein structure that plays a role in viral attachment and pathogenesis. At present, there are two Triple β-Spiral structures with solved crystallographic coordinates – one from Adenovirus and the other from Reovirus. There is evidence that the fold also occurs in Bacteriophage SF6. In this thesis, we present a computational analysis of the Triple β-Spiral fold. Our goal is to discover new instances of the fold in protein sequence databases. In Chapter 2, we present a series of sequence-based methods for the discovery of the fold. The final method in this Chapter is an iterative profile-based search that outperforms existing sequence-based algorithms. In Chapter 3, we introduce specific knowledge of the protein’s structure into our prediction algorithms. Although this additional information does not improve the profile-based methods in Chapter 2, it does provide insight into the important forces that drive the Triple β-Spiral folding process. In Chapter 4, we employ logistic regression to integrate the score information from the previous Chapter into a single unified framework. This framework outperforms all previous methods in cross-validation tests. We do not discover a great number of additional instances of the Triple β-Spiral fold outside of the Adenovirus and Reovirus families. The results of our profile based templates and score integration tools, however, suggest that these methods might well succeed for other protein structures. Thesis Supervisor: Bonnie A. Berger Title: Professor of Applied Mathematics Thesis Supervisor: Roy E. Welsch Title: Professor of Statistics and Management
منابع مشابه
Expression of TGF-β3 in Isolated Fibroblasts from Foreskin
Background: The multifunctional transforming growth factor beta (TGF-β) is a glycoprotein that exists in three isoforms. TGF-β3 expression increases in fetal wound healing and reduces fibronectin and collagen I and III deposition, and also improves the architecture of the neodermis which is a combination of blood vessels and connective tissue during wound healing. Fibroblasts are key ...
متن کاملA statistically derived parameterization for the collagen triple-helix.
The triple-helix is a unique secondary structural motif found primarily within the collagens. In collagen, it is a homo- or hetero-tripeptide with a repeating primary sequence of (Gly-X-Y)(n), displaying characteristic peptide backbone dihedral angles. Studies of bulk collagen fibrils indicate that the triple-helix must be a highly repetitive secondary structure, with very specific constraints....
متن کاملDesign and Analysis of Ultra-wide Band Bandpass Filter Using Spiral Stub-Loaded Triple-Mode Resonator with a Notched Band
An ultra-wide band band-pass filter using novel spiral stub-loaded triple-mode resonator (SSLTMR) is presented. New spiral stub loaded resonator is analyzed with odd and even modes analysis for this class of BPF, achieving higher band wide and size reduction. In order to have a good response characterized, two (SSL-TMRs) and two quarter wavelength digital coupled lines are used. This new design...
متن کاملPhylogenetic Analysis of Beta-Glucanase Producing Actinomycetes Strain TBG-CH22 - A Comparison of Conventional and Molecular Morphometric Approach
Actinomycetes are inexhaustible producers of commercially valuable metabolites, are continually screened for beneficial compounds. The taxonomic and phylogenetic study of novel actinomycetes strains are mostly based on conventional methods and primary DNA structure of 16s rRNA. Although 16s rRNA sequence is well accepted in phylogeny studies, its secondary structures have not been widely used. ...
متن کاملConceptual framework for performing simultaneous fold and sequence optimization in multi-scale protein modeling
We present a dual optimization concept of predicting optimal sequences as well as optimal folds of off-lattice protein models in the context of multi-scale modeling. We validate the utility of the recently introduced hidden-force Monte Carlo optimization algorithm by finding significantly lower energy folds for minimalist and detailed protein models than previously reported. Further, we also fi...
متن کامل